skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Krstić, Miroslav"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. This paper studies the distributed feedback optimization problem for linear multi-agent systems without precise knowledge of local costs and agent dynamics. The proposed solution is based on a hierarchical approach that uses upper-level coordinators to adjust reference signals toward the global optimum and lower-level controllers to regulate agents’ outputs toward the reference signals. In the absence of precise information on local gradients and agent dynamics, an extremum-seeking mechanism is used to enforce a gradient descent optimization strategy, and an adaptive dynamic programming approach is taken to synthesize an internal-model-based optimal tracking controller. The whole procedure relies only on measurements of local costs and input-state data along agents’ trajectories. Moreover, under appropriate conditions, the closed-loop signals are bounded and the output of the agents exponentially converges to a small neighborhood of the desired extremum. A numerical example is conducted to validate the efficacy of the proposed method. 
    more » « less
    Free, publicly-accessible full text available December 16, 2025
  2. This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving. 
    more » « less
    Free, publicly-accessible full text available January 1, 2026
  3. null (Ed.)
    Abstract Control Barrier Functions (CBFs) have become popular for enforcing — via barrier constraints — the safe operation of nonlinear systems within an admissible set. For systems with input delay(s) of the same length, constrained control has been achieved by combining a CBF for the delay free system with a state predictor that compensates the single input delay. Recently, this approach was extended to multi input systems with input delays of different lengths. One limitation of this extension is that barrier constraint adherence can only be guaranteed after the longest input delay has been compensated and all input channels become available for control. In this paper, we consider the problem of enforcing constraint adherence when only a subset of input delays have been compensated. In particular, we propose a new barrier constraint formulation that ensures that when possible, a subset of input channels with shorter delays will be utilized for keeping the system in the admissible set even before longer input delays have been compensated. We include a numerical example to demonstrate the effectiveness of the proposed approach. 
    more » « less